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COMBINATORIAL FLUORESCENCE ENERGY TRANSFER TAGS AND 

USES THEREOF 

5 

This application claims priority of Provisional 
Application no. 60/309,156, filed July 31, 2001 and 
is a continuation in part of U.S. Serial No. 
09/658,077, filed September 11, 2000, the contents of 
.0 both of which are hereby incorporated by reference 

into the subject application. 

Throughout this application, various publications are 
referenced in parentheses by author and year. Full 

.5 citations for these references may be found at the 

end of the specification immediately preceding the 
claims. The disclosures of these publications in 
their entireties are hereby incorporated by reference 
into this application to more fully describe the 

0 state of the art to which this invention pertains. 

Background Of The Invention 

The need to study many biological targets 
5 simultaneously drives the development of multiplex 

fluorescent tags. However, due to the limits of the 
spectral region, and therefore the availability of 
appropriate detectors, the number of available 
fluorescent dyes that have distinguishable emission 
0 spectra is limited to about ten. To overcome this 

limitation, a combinatorial fluorescent labeling 
approach for multi-color fluorescence in situ 
hybridization (M-FISH) has been developed and is now 
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widely used in the field of cytogenetics (Speicher et 
al., 1996; Schrock et al . , 1996). This approach mixes 
from two to seven individual fluorescent dyes that 
have unique emissions, and uses the fluorescence 
5 emission pattern to identify the different targets. 

The unique fluorescence emission pattern is achieved 
by mathematically combining the different dyes. This 
development has made possible advances in chromosome 
analyses. However, the procedure requires physically 

10 mixing the individual dyes in a quantitative manner 

to develop "unique" probe labels. This requirement, 
coupled with the potential interactions of the dyes, 
complicates the fluorescence emission patterns . 
Therefore, the major application of the technique is 

15 limited to methods that involve hybridization. 

Multiple lasers and detectors are also required for 
the imaging. A reagent kit that can be used to 
covalently label a wide range of biomolecules is 
difficult to construct with this approach. Thus, 

20 there is an urgent need for a large set of 

fluorescent tags that can be used for multiple 
component analyses in biomedical and other fields . 
Previously, the principle of fluorescent energy 
transfer (ET) was used to enhance fluorescence 

25 emission for the successful development of four ET 

tags for deoxyribonucleic acid (DNA) sequencing which 
are widely used in the Human Genome Project (Ju et 
al . 1995, 1996). Tags containing fluorophores in 
energy transfer relationships have been disclosed in 

30 U.S. Patent 6,028,190. 
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Summary Of The Invention 



This invention provides a composition of matter 
5 comprising multiple fluorophores, each of which is 

bound to a molecular scaffold at a separate 
predetermined position on the scaffold, such separate 
predetermined positions being selected so as to 
permit fluorescence energy transfer between one such 
10 fluorophore and another such fluorophore, wherein the 

one such fluorophore and the another such fluorophore 
are characterized by the maximum emission wavelength 
of one being greater than the minimum excitation 
wavelength of the other. 

15 

This invention further provides the instant 
composition of matter comprising two f luorophores , 
each of which is bound to a molecular scaffold, at a 
separate predetermined position on the scaffold, such 

20 separate positions being selected so as to permit 

fluorescence energy transfer between such 
f luorophores, and such fluorophores being 

characterized by the maximum emission wavelength of 
one of the fluorophores being greater than the 

25 minimum excitation wavelength of the other 

fluorophore . 

This invention further provides the instant 
composition of matter comprising three fluorophores 
30 each of which is bound to a molecular scaffold at- a 

separate predetermined position on the scaffold, such 
separate predetermined positions being selected so as 
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to permit fluorescence energy transfer among such 
fluorophores and such fluorophores being 

characterized by the maximum emission wavelength of 
one such fluorophore being greater than the minimum 
5 excitation wavelength of the second such fluorophore 

and the maximum emission wavelength of such second 
fluorophore being greater than the minimum excitation 
wavelength of the third such fluorophore. 



10 This invention further provides the instant 

composition of matter, wherein each fluorophore is 
covalently bound to the molecular scaffold. 

This invention further provides the instant 
15 composition of matter, wherein the efficiency of the 

fluorescence energy transfer is less than 20%. 



This invention further provides the instant 
composition of matter, wherein the molecular scaffold 
20 is rigid. 



25 



This invention further provides the instant 

composition of matter, wherein the molecular scaffold 
is polymeric. 

This invention further provides the instant 

composition of matter, wherein the molecular scaffold 
comprises a nucleic acid. 



30 This invention further provides the instant 

composition of matter, wherein the molecular scaffold 
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comprises a peptide. 

This invention further provides the instant 
composition of matter, wherein the molecular scaffold 
5 comprises a polyphosphate. 

This invention further provides the instant 
composition of matter, wherein at least one 
fluorophore is a fluorescent dye. 

10 

This invention further provides the instant 
composition of matter, wherein the fluorescent dye is 
6-carboxyf luorescein . 

15 This invention further provides the instant 

composition of matter, wherein the fluorescent dye is 
N , N , N 1 , N 1 -tetramethyl-6-carboxyrhodamine . 

This invention further provides the instant 
20 composition of matter, wherein the fluorescent dye is 

cyanine-5 monof unctional dye. 

This invention further provides the instant 
composition of matter, wherein at least one 
25 fluorophore is a luminescent molecule. 

This invention further provides the instant 
composition of matter, wherein at least one 
fluorophore is a quantum dot. 
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This invention also provides a composition of matter 
having the structure: 



FAM 



TAM 

Q-T— (S) m — T -R 



10 wherein S represents a 1 1 , 2 9 -dideoxyribose 

phosphate moiety, m is an integer greater than 1 
and less than 100, each T represents a thymidine 
derivative, FAM represents 6-carboxyf luorescein 
derivative, TAM represents N, N, N' , N' -tetramethyl- 

15 6-carboxyrhodamine derivative, each solid line 

represents a covalent bond, R represents either a 
hydroxy or phosphate terminus and Q represents 
either a hydroxy or phosphate terminus, with the 
proviso that R and Q are different. 

20 

This invention further provides the instant 
composition of matter, wherein m is 4 . 

This invention further provides the instant 
25 composition of .matter, wherein m is 6. 



This invention further provides the instant 
composition of matter, wherein m is 9. 
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This invention further provides the instant 
composition of matter, wherein m is 13. 



5 This invention also provides a composition of matter 

having the structure: 



FAM 



Cy5 



I 

Q-T — (S) — T-R 



wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety/ m is an integer greater than 1 

15 and less than 100, T represents a thymidine 

derivative, FAM represents a 6-carboxyf luorescein 
derivative, Cy5 represents a cyanine-5 
monof unctional dye derivative, each solid line 
represents a covalent bond, R represents either a 

20 hydroxy or phosphate terminus and Q represents 

either a hydroxy or phosphate terminus, with the 
proviso that R and Q are different. 



This invention further provides the instant 
25 composition of matter, wherein m is 4 . 

This invention further provides the instant 
composition of matter, wherein m is 5- 
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This invention further provides the instant 
composition of matter, wherein m is 7 . 



This invention further provides the 
composition of matter, wherein m is 10. . 



instant 



10 



This invention further provides the 
composition of matter, wherein m is 13. 



instant 



This invention also provides a composition of matter 
comprising the structure shown below: 



FAM 



TAM 



Cy5 



Q- T— (S) m — T— (S) n — T-R 



wherein S represents a 1 ' , 2 1 -dideoxyribose 
20 phosphate moiety, m is an integer greater than 1 

and less than 100, n is an integer greater than 1' 
and less than 100, T represents a thymidine 
derivative, FAM represents a 6-carboxyf luorescein 
derivative, Cy5 represents a cyanine-5 
25 monof unctional dye derivative, TAM represents a 

N, N, N' , N r -tetramethyl-6-carboxyrhodamine 
derivative, each solid line represents a covalent 
bond, R represents either a hydroxy or phosphate 
terminus and Q represents either a hydroxy or 
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phosphate terminus, with the proviso that R and Q 
are different. 

This invention further provides the instant 

5 composition of matter, wherein m is 3, and n is 7 . 

This invention further provides the instant 

composition of matter, wherein m is 4, and n is 6. 

10 This invention further provides the instant 

composition of matter, wherein m is 5, and n is 5 

This invention further provides the instant 

composition of matter, wherein m is 6, and n is 6. 

15 

This invention further provides the instant 

composition of matter, wherein m is 7, and n is 7 . 

This invention also provides a composition of matter 
20 comprising the structure shown below: 



TAM 



TAM 



25 



Q-T-(S) m -T-R 
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wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety, m represents an integer greater 
than 1 and less than 100, T represents a 
thymidine derivative, and TAM represents a 
5 N, N, N' , N' -tetramethyl-6-carboxyrhodamine 

derivative, each solid line represents a covalent 
bond, R represents either a hydroxy or phosphate 
terminus and Q represents either a hydroxy or 
phosphate terminus, with the proviso that R and Q 
10 are different. 

This invention further provides the instant 
composition of matter, wherein m is 4 . 

15 This invention also, provides a nucleic acid labeled 

with any of the instant compositions. 

This invention provides any of the instant 
compositions, wherein the nucleic acid is DNA. 

20 

This invention provides any of the instant 
compositions, wherein the nucleic acid is RNA. 

This invention provides any .of the instant 
25 compositions, wherein the nucleic acid is DNA /RNA - 



This invention also provides a method of determining 
whether a preselected nucleotide residue is present 
at a predetermined position within a nucleic acid 
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comprising the steps of: 

(a) contacting the nucleic acid, under 
hybridizing and DNA ligation-permitting 
conditions, with (i) a DNA ligase, (ii) a 
5 first oligonucleotide having affixed 

thereto a composition of matter of claim 1 
wherein the first oligonucleotide 

hybridizes with nucleotides immediately 
adjacent one side of the predetermined 

10 position and (iii) a second oligonucleotide 

which hybridizes with the nucleotides 
immediately adjacent the other side of the 
predetermined position, wherein the 
hydroxy-terminal residue of the 

15 oligonucleotide which hybridizes to the 

nucleotide located 3' of the predetermined 
position is a nucleotide which is 
complementary to the preselected nucleotide 
residue; and 

20 (b) detecting the presence of a ligation 

product comprising both the first and the 
second oligonucleotides, the presence of 
such a ligation product indicating the 
presence of the preselected nucleotide 

25 residue at the predetermined position. 



This invention further provides a method of 
determining whether at various predetermined 
positions within a nucleic acid, a preselected 
30 nucleotide residue is present at such position, 

wherein the preselected nucleotide residue may vary 
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at different predetermined positions which comprises 
determining whether each preselected nucleotide is 
present each predetermined position according to the 
instant method. 

5 

This invention provides the instant method, wherein 
the presence of a plurality of given nucleotide 
residues is determined simultaneously. 

10 This invention further provides the instant method, 

wherein the DNA ligase is Taq DNA ligase. 

This invention further provides the instant method, 
wherein the second oligonucleotide has an isolation- 
15 permitting moiety affixed thereto, and wherein the 

method further comprises the steps of isolating the 
moiety-containing molecules resulting from step (a) 
and determining the presence therein of ligated first 
and second oligonucleotides. 

20 

This invention further provides the instant method, 
wherein the composition of matter affixed to the 
first oligonucleotide has a predetermined emission 
spectrum, and wherein the observation of this 
25 emission spectrum is employed to determine the 

presence of ligated first and second oligonucleotides 
in step (b) . 



30 



This invention also provides a method of 
whether a. preselected nucleotide residue 



determining 
is present 
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at a predetermined position within a nucleic acid 
comprising the steps of: 

(a) contacting the nucleic acid, under 
hybridizing and DNA polymerization-permitting 
5 conditions, with (i) a DNA polymerase, (ii) an 

oligonucleotide (1) having affixed thereto a 
composition of matter of claim 1, and (2) having 
a hydroxyl 3 1 terminus thereof, wherein the 
oligonucleotide hybridizes with the 3' region of 

10 the nucleic acid molecule flanking the 

predetermined position, and (iii) a 

dideoxynucleotide labeled with an isolation- 
permitting moiety, wherein the labeled 
dideoxynucleotide is complementary to the given 

15 nucleotide residue, 

with the proviso that upon hybridization of the 
oligonucleotide with the nucleic acid in the 
presence of DNA polymerase and the preselected 
nucleotide residue, the oligonucleotide and 
20 dideoxynucleotide are juxtaposed so as to permit 

their covalent linkage by the DNA polymerase; 

(b) detecting the presence of a polymerization 
product comprising both the oligonucleotide and 
the dideoxynucleotide, the presence of such a 
25 polymerization product indicating the presence 

of the preselected nucleotide residue at the 
predetermined position. 

This invention further provides a method of 
30 determining whether at various predetermined 

positions within a nucleic acid, a preselected 
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nucleotide residue is present at such position, 
wherein the preselected nucleotide residue may vary 
at different predetermined positions which comprises 
determining whether each preselected nucleotide is 
5 present each predetermined position according to the 

instant method. 

This invention further provides the instant method, 
wherein the DNA polymerase is thermo sequenase. 

This invention further provides the instant method, 
wherein the dideoxynucleotide is selected from the 
group consisting of dideoxyadenosine triphosphate, 
dideoxycytidine triphosphate, dideoxyguanosine 

triphosphate, dideoxythymidine triphosphate, and 
dideoxyuridine triphosphate . 

This invention further provides the instant method, 
wherein the composition of matter affixed to the 
20 oligonucleotide has a predetermined emission 

spectrum, and wherein the observation of this 
emission spectrum is employed to determine the 
presence of polymerization product in step (b) . 

25 This invention further provides the instant methods, 

wherein observing the predetermined emission spectrum 
is performed using radiation having a wavelength of 
between 200 and lOOOnm. 



10 



15 



30 



This invention further- provides the instant methods, 



WO 02/22883 



15 



PCT/US01/28967 



wherein the radiation has a wavelength of 488 nm. 

This invention further provides the instant methods, 
wherein observing the predetermined emission spectrum 
is performed using radiation having a bandwidth of 
between 1 and 50nm. 

This invention further provides the instant methods, 
wherein the radiation bandwidth is lnm. 

This invention further provides the instant methods, 
wherein the isolation-permitting moiety comprises 
biotin, streptavidin, phenylboronic acid, 

salicylhydroxamic acid, an antibody or an antigen. 

This invention further provides the instant methods, 
wherein the isolation-permitting moiety is attached 
to the oligonucleotide via a linker molecule. 

This invention further provides the instant methods, 
wherein the isolation-permitting moiety is attached 
to the dideoxynucleotide via a linker molecule. 

This invention further provides the instant methods, 
wherein the linker molecule is chemically cleavable. 

This invention further provides the instant methods, 
wherein the linker molecule is photocleavable . 
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This invention further provides the instant methods, 
wherein the linker molecule has the structure: 
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Brief Description Of The Figures 



Figure 1A-B: (A) Schematic of a multi-chromophore 
assembly connected to a linker. In general, 1 to n 
5 chromophores can be attached to the assembly with the 

chromophores separated by spacers as shown. 
Chromophores can be, but not limited to, fluorescent 
dyes, quantum dots or luminescent molecules such as 
terbium chelate. A variety of spacers such as 

10 nucleotides, peptides, a polymer linker formed by 1' , 

2 ' -dideoxyribose phosphates or other chemical 
moieties can be used. The assembly label shown here 
is connected to a linker which can be designed as 
nucleic acids, proteins or cells, etc for multiplex 

15 biological assays. (B) The synthesis of F-4-T-6-C. 

The numbers in F-4-T-6-C refer to the number of 
spacing nucleotides in the scaffold between dyes F 
and T, and T and C. F = Fam; T = Tarn; C = Cyanine 5 
monof unctional dye. 

20 

Figure 2A-D: Spectroscopic data for tags F-4-T-6-C 
and F-7-T-3-C. 

(A) Two tags with different fluorescent signatures 
have been constructed by varying the spacing 

25 between the three dyes F, T, and C. 

(B) Ultraviolet/visible (UV/vis) absorption spectrum 
of dye F-4-T-6-C. 

(C) Fluorescence emission spectra of dye F-4-T-6-C. 

(D) Fluorescence emission spectra of dye F-7-T-3-C. 
3 0 F = Fam; T = Tarn; C = Cy5 . 



Figure 3A-B : 



Schematic labeling approach to 
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construct CFET-primers and CFET-dUTPs . The spacer 
between dyes is 1' , 2' -dideoxyribose phosphate (S) in 
(A) and proline (P) in (B) . "m" and "n" refer to the 
number of molecules in the spacer. dUTP 
5 deoxyuridine triphosphate. 

* Figure 4: The synthesis of CFET-dUTP. The CFET tag 
comprises three different fluorescent dyes: Fam, Tam 
and Cy5. 

10 

Figure 5 : Structures of Aminoallyl (AA) -dUTP, Fam- 
proline, and N-Hydroxy succinimide (NHS) esters of 
TAM and Cy5 . 

15 Figure 6 : Synthetic schemes to prepare Fam-proline, 

Azido-proline and Cy5-phosphine . TMSCI = 

trimethylsilyl chloride . 

Figure 7 : The eight unique fluorescence signatures of 
20 CFET tags generated in a three-color CAE system- FAM 

channel (520 ± 20 nm, dotted line), TAM channel (585 
± 20 nm, solid thin line), Cy5 channel (670 ± 20 nm, 
solid thick line) . The digital ratio denoting the 
fluorescence signature for each CFET tag from the 
25 three channels [dotted : thin : thick] is shown in the 

brackets. The fluorescence signatures in the 
electropherogram were obtained by excitation at 488 
nm and electrokinetic injection of the eight CFET- 
labeled oligonucleotides into the three-color CAE 
30 system. 
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Figure 8A-B : Schematic of using iigase chain reaction 
for determining the genotype at a locus containing a 
possible single-base mutation. 

(A) Primer pairs are generated surrounding a base 
5 that can be mutated. The wild-type primer is 

labeled with one CFET tag (Tag 1) and the 
mutation-specific primer with another CFET tag 
(Tag 2) . 

(B) Subsequent gel electrophoresis allows separation 
10 of ligated primer pairs and unincorporated 

primers. Different bands appear, on the gel 
depending on whether the template is wild-type 
or mutated. 

15 Figure 9 : Schematic of expected results from 

screening four potential mutation sites of Rbl gene 
using eight unique CFET Tags and the ligase chain 
reaction assay. Only ligation products are shown on 
the gels. 

20 

Figure 10 : Schematic of chromosomal studies to detect 
macrodeletions and amplifications. 

Figure 11 : This figure schematically shows the 
25 procedure for multiplex SNP detection through the 

ligation of hybridized CFET-labeled and biotinylated 
oligonucleotides. Taq DNA ligase seals the nick 
between the two hybridized oligonucleotides if the 
nucleotides at the ligating junction are correctly 
30 base-paired to the template (A to T; C to G) . CFET- 

labeled, biotinylated ligation products are then 
isolated using streptavidin-coated magnetic beads. 
After washing and releasing from the magnetic beads, 
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the ligation products are electrokinetically injected 
into a three-color CAE system. Each CFET-labeled 
ligation product, which identifies a unique SNP, is 
unambiguously detected due to its distinct mobility 
5 and fluorescence signature in the CAE 

electropherogram. 

Figure 12A-B : Electropherogram of CFET-labeled 
ligation products for SNPs identification on exon 20 

10 of the RBI. (A) Detection of six nucleotide 

variations from synthetic DNA templates. FAM ( T) and 
F-10-Cy5 ( T) peaks are obtained from two different 
locations of the same template. F-9-T (C) and F-13-T 
( T) peaks indicate mutations from the same locus of a 

15 DNA template, while F-4-T-6-Cy5 {A) and F-7-T-7-Cy5 

(C) peaks identify mutations from the same locus of 
another DNA template. (B) Detection of three 
homozygous genotypes (T , C and A) from a PCR product 
of RBI. 

20 

Figure 13 : This figure is a schematic of single base 
primer extension for multiplex SNP detection by using 
dye-labeled primers and biotinylated 

dideoxynucleoside triphosphates (ddNTP-Biotin) . DNA 

25 template containing polymorphic sites is incubated 

with a dye-labeled primer, hybridizing the template 
adjacent to the polymorphic site, ddNTP-Biotin and 
thermo sequenase. At the end of reaction and 
purification the primer extension products are 

30 analyzed for fluorescence signatures. 



Figure 14 : Three unique fluorescence signatures 
generated from dye-labeled extension products. FAM 
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channel (light) and TAM channel (Dark) . The 
fluorescence signatures in the electropherograms were 
obtained by excitation at 488nm and the single base 
extension of the dye-labeled primers. The digital 
5 ratio denoting the fluorescence signature for each 

from the two detection channels is shown in 
parentheses . 

Figure 15A-C : The electropherograms of CFET-labeled 
10 primer extension products for multiplex SNPs 

identification on the mimic of exon 20 of the RBI. 
FAM channel (Light line) and TAM channel (Dark line) . 
(A) : Detection of two individual homozygous genotypes 
from a wild type template. FAM (T) and F-9-T (C) 
15 peaks were obtained from two different loci on the 

template. (B) : Similar to (A) except a mutated 
template was used. (C) : Simultaneous detection of 
three nucleotide variations. FAM (T) peak was 
obtained from a locus of the template where a 
20 homozygous genotype was found. F-9-T (C) and F-13-T 

(T) peaks indicate the mutation R661W (heterozygote ) 
from the same locus of a DNA template. 

Figure 16 : Schematic of a high throughput channel 
25 based, moiety-based purification system. Sample 

solutions can be pushed back and forth between the 
two plates through glass capillaries and the coated 
channels in the chip, the channels being coated with 
an appropriate chemical to bind the moiety tag on the 
30 samples, e.g. streptavidin coating in the case of 

biotinylated oligonucleotides. Where the moieties are 
attached by cleavable linkers, e.g. photocleavable 
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linkers, the whole chip can be irradiated to cleave 
the samples after immobilization. 
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Detailed Description Of The Invention 



Definitions 

The following definitions are offered as an aid to 
understanding the invention: 



10 



15 



CAE 
CFET 

Cy 5 

ddNTP 

FAM 

nm 

RBI 

SNP 

TAM 



Capillary Array Electrophoresis 
Combinatorial fluorescence energy 
transfer; 

Cyanine 5 monof unctional dye; 
Dideoxynucleotide trisphosphate ; 
6-carboxyf luorescein; 
nanometer 

Retinoblastoma gene ; 
Single nucleotide polymorphism; 
N,N,N' , N' -tetramethyl-6-carboxy 
rhodamine . 



As used herein, and unless stated otherwise, each of 
20 the following terms shall have the definition set 

forth below. 



"Chemically cleavable" shall mean cleavable by any 
chemical means including but not limited to pH and 
25 temperature. 

" DNA/RNA" shall mean a nucleic acid molecule 
comprising both deoxyribonucleotides and 

ribonucleotides . 

30 

"Emission spectrum" shall mean the amplitude and 
frequency of energy emitted from a composition of 
matter as a result of exciting radiation thereon. 
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"Flexible", when used to describe a molecular 
scaffold, shall mean that the distance between the 
centers of any pair of fluorophores covalently bound 
5 to the scaffold varies by more than 50%. 



"Fluorescence energy transfer" shall mean the 
transfer of energy between two fluorophores via a 
dipole-dipole interaction . 

10 

"Fluorescent dye" shall mean an organic dye molecule 
capable of emitting fluorescent energy of wavelength 
between 200 and lOOOnm when excited by an energy of 
shorter wavelength wherein the emitted energy results 
15 from a singlet to singlet transition. Examples are 6- 

carboxyf luorescein, N , N , N' , N' -tetramethyl-6- 

carboxyrhodamine, and cyanine-5 monof unctional dye. 



" Fluorophore" shall mean a molecule, such as a 
20 fluorescent dye, quantum dot or luminescent molecule, 

capable of emitting energy of wavelength between 400 
and lOOOnm when excited by an energy of shorter 
wavelength than the corresponding emission 
wavelength. Examples of fluorophores include 6- 
2 5 carboxy fluorescein, N, N, N 1 , N' -tetramethyl-6-carboxy 

rhodamine, cyanine-5 monof unctional dye, zinc 
sulf ide-capped cadmium selenide quantum dots, and 
lanthanide chelates . 



30 "Hybridize" shall mean the annealing of one single- 

stranded nucleic acid molecule to another single 
stranded nucleic acid molecule based on sequence 
complementarity. The propensity for hybridization 
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10 



15 



20 



25 



between nucleic acids depends on the temperature and 
ionic strength of their milieu, the length of the 
nucleic acids and the degree of complementarity. The 
effect of these parameters on hybridization is well 
known in the art (see Sambrook, 1989) . 

"Isolation-permitting moieties" shall iiiclude without 
limitation biotin or streptavidin which bind to one 
another, antibodies or antigens which bind to one 
another, phenylboronic acid or salicylhydroxamic acid 
which bind to one another. 

"Ligation-permitting conditions" include without 
limitation conditions of temperature, ionic strength, 
ionic composition, molecular composition, orientation 
and viscosity that allow one oligonucleotide to be 
joined enzymatically to another via a phosphodiester 
bond . 

"Ligation" shall mean the enzymatic covalent joining 
of a nucleic acid with either another nucleic acid or 
a single nucleotide. 

"Linker molecule" shall mean a chemical group used to 
covalently join two other molecules. An example of a 
linker molecule is the structure given below: 



H 
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"Luminescent molecule" shall mean a molecule capable 
of emitting energy of wavelength between 200 and 
lOOOnm when excited by energy of shorter wavelength 
than the corresponding emission wavelength, wherein 
5 the emitted energy does not result from a singlet to 

singlet transition. Examples of luminescent molecules 
include europium polycarboxylate chelate and terbium 
chelates. 

"Molecular scaffold" shall mean a molecular structure 
to which two or more fluorophores can be, and/or are, 
covalently bound at discrete loci thereon. Ideally, a 
molecular scaffold is polymeric, comprising monomeric 
units to which fluorophores can be bound. The 
monomeric units which make up such polymeric scaffold 
can, but need not be> identical. Examples of such 
monomeric units include 1 ' , 2 ' -dideoxyribose phosphate 
and thymidine. 

20 "Nucleic acid molecule" shall mean any nucleic acid 

molecule, including, without limitation, DNA, RNA and 
hybrids thereof. The nucleic acid bases that form 
nucleic acid molecules can be the bases A, C, G, T 
and U, as well as derivatives thereof. Derivatives of 

25 these bases are well known in the art, and are 

exemplified in PCR Systems, Reagents and Consumables 
(Perkin Elmer Catalogue 1996-1997, Roche Molecular 
Systems, Inc., Branchburg, New Jersey, USA). 



10 



15 



30 "Oligonucleotide" shall mean a nucleic acid 

comprising two or more nucleotides. 
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"Photocleavable" shall mean cleavable by 

electromagnetic energy of between 200 and lOOOnm 
wavelength . 

5 "Polymeric" shall describe a molecule composed of 

more than two monomeric units. 

"Quantum dot" shall mean a nanometer-sized 
composition of matter comprising a semi-conductor or 
10 metal, wherein such composition is capable of 

luminescence. Examples of quantum dots include zinc- 
sulf ide-capped cadmium selenide quantum dots. 

"Rigid", when used to describe a molecular scaffold, 
15 shall mean that the distance between the centers of 

any pair of fluorophores covalently bound to the 
scaffold does not vary more than 50%. 

Embodiments of the Invention 

20 

This invention provides a composition of matter 
comprising multiple fluorophores, each of which is 
bound to a molecular scaffold at a separate 
predetermined position on the scaffold, such separate 

25 predetermined positions being selected so as to 

permit fluorescence energy transfer between one such 
fluorophore and another such fluorophore, wherein the 
one such fluorophore and the another such fluorophore 
are characterized by the maximum emission wavelength 

30 of one being greater than the minimum excitation 

wavelength of the other. 



This invention further provides the instant 



WO 02/22883 



28 



PCT/US01/28967 



composition of matter comprising two f luorophores , 
each of which is bound to a molecular scaffold, at a 
separate predetermined position on the scaffold, such 
separate positions being selected so as to permit 
5 fluorescence energy transfer between such 

f luorophores, and such fluorophores being 

characterized by the maximum emission wavelength of 
one of the fluorophores being greater than the 
minimum excitation wavelength of the other 
10 fluorophore. 



This invention further provides the instant 
composition of matter comprising three fluorophores 
each of which is bound to a molecular scaffold at a 

15 separate predetermined position on the scaffold, such 

separate predetermined positions being selected so as 
to permit fluorescence energy transfer among such 
fluorophores and such fluorophores being 

characterized by the maximum emission wavelength of 

20 one such fluorophore being greater than the minimum 

excitation wavelength of the second such fluorophore 
and the maximum emission wavelength of such second 
fluorophore being greater than the minimum excitation 
wavelength of the third such fluorophore. 

25 

In one embodiment each fluorophore is covalently 
bound to the molecular scaffold. 



In one embodiment the efficiency of the fluorescence 
30 energy transfer is less than 20%. 
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In one embodiment the molecular scaffold is rigid. 

In one embodiment the molecular scaffold i 
polymeric . 

5 

In one embodiment the molecular scaffold comprises 
nucleic acid. 

In one embodiment the molecular scaffold comprises 
10 peptide. 

In one embodiment the molecular scaffold comprises 
polyphosphate . 

15 In one embodiment at least one fluorophore is 

fluorescent dye. 

In one embodiment the fluorescent dye is 6 
carboxyf luorescein . 

20 

In one embodiment the fluorescent dye is N/N/N'/N 1 
tetramethyl-6-carboxyrhodamine . 

In one embodiment the fluorescent dye is cyanine- 
25 monof unctional dye. 

In one embodiment at least one fluorophore is 
luminescent molecule. 

30 In one embodiment at least one fluorophore is 

quantum dot. 
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This, invention also provides a composition of matter 
having the structure: 



FAM TAM 
Q-T-(S) m -T -R 



wherein S represents a 1 1 , 2 1 -dideoxyribose 
10 phosphate moiety, m is an integer greater than 1 

and less than 100, each T represents a thymidine 
derivative, FAM represents 6-carboxyf luorescein 
derivative, TAM represents N , N, N' , N' -tetramethyl- 
6-carboxyrhodamine derivative, each solid line 
15 represents a covalent bond, R represents either a 

hydroxy or phosphate terminus and Q represents 
either a hydroxy or phosphate terminus, with the 
proviso that R and Q are different. 

20 In one embodiment m is 4 . In one embodiment m is 6. 

In one embodiment m is 9. In one embodiment m is 13. 



25 



This invention also provides a composition of matter 
having the structure: 



3C 



FAM 



Cy5 

Q- T — (S) m — T-R 



wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety, m is an integer greater than 1 
and less than 100, T represents a thymidine 
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derivative, FAM represents a 6-carboxyf luorescein 
derivative, Cy5 represents a cyanine-5 
monof unctional dye derivative, each solid line 
represents a covalent bond, R represents either a 
5 hydroxy or phosphate terminus and Q represents 

either a hydroxy or phosphate terminus, with the 
proviso that R and Q are different. 

In one embodiment m is 4. In one embodiment m is 5. 
10 In one embodiment n m is 7. In one embodiment m is 

10. In one embodiment m is 13. 

This invention, also provides a composition of matter 
comprising the structure shown below: 

15 



20 



FAM 



TAM 



Cy5 



Q- T— (S) m — T— (S) — T-R 



wherein S represents a 1 ' , 2 1 -dideoxyribose 
phosphate moiety, m is an integer greater than 1 
and less than 100, n is an integer greater than 1 
and less than 100, T represents a thymidine 

25 derivative, FAM represents a 6-carboxyf luorescein 

derivative, Cy5 represents a cyanine-5 
monof unctional dye derivative, TAM represents a 
N, N, N' , N' -tetramethyl-6-carboxyrhodamine 
derivative, each solid line represents a covalent 

30 bond, R represents either a hydroxy or phosphate 

terminus and Q represents either a hydroxy or 
phosphate terminus, with the proviso that 'R and Q 
are different. 
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In one embodiment m is 3, and n is 7. In one 
embodiment, wherein m is A, and n is 6. In one 
embodiment m is 5, and n is 5. In one embodiment m is 
5 6, and n is 6. In one embodiment m is 7, and n is 7. 

This invention also provides a composition of matter 
comprising the structure shown below: 



10 



TAM 



TAM 

T 1 



Q-T-(S) m -T-R 



15 



20 



25 



wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety, m represents an integer greater 
than 1 and less than 100, T represents a 
thymidine derivative, and TAM represents a 
N, N, N' , N' -tetramethyl-6-carboxyrhodamine 
derivative, each solid line represents a covalent 
bond, R represents either a hydroxy or phosphate 
terminus and Q represents either a hydroxy or 
phosphate terminus, with the proviso that R and Q 
are different. 



In one embodiment m is 4 . 

This invention also provides a nucleic acid labeled 
30 with any of the instant compositions. 



In one embodiment the nucleic acid is DNA. 
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In one embodiment the nucleic acid is RNA. 

In one embodiment the nucleic acid is DNA/RNA. 

5 This invention also provides a method of determining 

whether a preselected nucleotide residue is present 
at a predetermined position within a nucleic acid 
comprising the steps of: 

(a) contacting the nucleic acid, under 
!0 hybridizing and DNA ligation-permitting 

conditions, with (i) a DNA ligase, (ii) a 
first oligonucleotide having affixed 
thereto a composition of matter of claim 1 
wherein the first oligonucleotide 

15 hybridizes with nucleotides immediately 

adjacent one side of the predetermined 
position and (iii) a second oligonucleotide 
which hybridizes with the nucleotides 
immediately adjacent the other side of the 

20 predetermined position, wherein the 

hydroxy-terminal residue of the 

oligonucleotide which hybridizes to the 
nucleotide located 3' of the predetermined 
position is a nucleotide which is 

25 complementary to the preselected nucleotide 

residue; and 

(b) detecting the presence of a ligation, 
product comprising both the first and the 
second oligonucleotides, the presence of 

30 such a ligation product indicating the 

presence of the preselected nucleotide 
residue at the predetermined position. 
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This invention further provides a method of 
determining whether at various predetermined 
positions within a nucleic acid, a preselected 
nucleotide residue is present at such position, 
5 wherein the preselected nucleotide residue may vary 

at different predetermined positions which comprises 
determining whether each preselected nucleotide is 
present each predetermined position according to the 
instant method. 

0 

In one embodiment the presence of a plurality of 
given nucleotide residues is determined 

simultaneously - 



15 In one embodiment the DNA ligase is Taq DNA ligase. 

This invention further provides the instant method, 
wherein the second oligonucleotide has an isolation- 
permitting moiety affixed thereto, and wherein the 
20 method further comprises the steps of isolating the 

moiety-containing molecules resulting from step (a) 
and determining the presence therein of ligated first 
and second oligonucleotides . 



25 This invention further provides the instant method, 

wherein the composition of matter affixed to the 
first oligonucleotide has a predetermined emission 
spectrum, and wherein the observation of this 
emission spectrum is employed to determine the 

30 presence of ligated first and second oligonucleotides 

in step (b) . 
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This invention also provides a method of determining 
whether a preselected nucleotide residue is present 
at a predetermined position within a nucleic acid 
comprising the steps of: 
5 (a) contacting the nucleic acid, under 

hybridizing and DNA polymerization-permitting 
conditions, with (i) a DNA polymerase, (ii) an 
oligonucleotide (1) having affixed thereto a 
composition of matter of claim 1, and (2) having 
10 a hydroxyl 3 f terminus thereof, wherein the 

oligonucleotide hybridizes with the 3' region of 
the nucleic acid molecule flanking the 
predetermined position, and (iii) a 

dideoxynucleotide labeled with an isolation- 
15 permitting moiety, wherein the labeled 

dideoxynucleotide is complementary to the given 
nucleotide residue , 

with the proviso that upon hybridization of the 
oligonucleotide with the nucleic acid in the 
20 presence of DNA polymerase and the preselected 

nucleotide residue, the oligonucleotide and 
dideoxynucleotide are juxtaposed so as to permit 
their covalent linkage by the DNA polymerase; 
(b) detecting the presence of a polymerization 
25 product comprising both the oligonucleotide and 

the dideoxynucleotide, the presence of such a 
polymerization product indicating the presence 
of the preselected nucleotide residue at the 
predetermined position. 

30 

This invention further provides a method of 
determining whether at various predetermined 
positions within a nucleic acid, a preselected 
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nucleotide residue is present at such position, 
wherein the preselected nucleotide residue may vary 
at different predetermined positions which comprises 
determining whether each preselected nucleotide is 
5 present each predetermined position according to the 

instant method. 

In one embodiment the DNA polymerase is thermo 
sequenase . 

This invention further provides the instant method, * 
wherein the dideoxynucleotide is selected from the 
group consisting of dideoxyadenosine triphosphate, 
dideoxycytidine triphosphate, dideoxyguanosine 

triphosphate, dideoxythymidine triphosphate, and 
dideoxyuridine triphosphate - 

This invention further provides the instant method, 
wherein the composition of matter affixed to the 
20 oligonucleotide has a predetermined emission 

spectrum, and wherein the observation of this 
emission spectrum is employed to determine the 
presence of polymerization product in step (b) . 

25 This invention further provides the instant methods, 

wherein observing the predetermined emission spectrum 
is performed using radiation having a wavelength of 
between 200 and lOOOnm. 



10 



15 



30 In one embodiment the radiation has a wavelength of 

488 run . 
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10 



15 



20 



25 



This invention further provides the instant methods, 
wherein observing the predetermined emission spectrum 
is performed using radiation having a bandwidth of 
between 1 and 50nm. 

In one embodiment the radiation bandwidth is Inm. 

This invention further provides the instant methods, 
wherein the isola.tion-permitting moiety comprises 
biotin, streptavidin, phenylboronic acid, 

salicylhydroxamic acid, an antibody or an antigen. 

This invention further provides the instant methods, 
wherein the isolation-permitting moiety is attached 
to the oligonucleotide via a linker molecule. 

This invention further provides the instant methods, 
wherein the isolation-permitting moiety is attached 
to the dideoxynucleotide via a linker molecule. 

This invention further provides the instant methods, 
wherein the linker molecule is chemically cleavable. 

This invention further provides the instant methods, 
wherein the linker molecule is photocleavable . 

This invention further provides the instant methods, ' 
wherein the linker molecule has the structure: 
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This invention will be better understood from the 
Experimental Details which follow. However, one 
skilled in the art will readily appreciate that the 
specific methods and results discussed are merely 
5 illustrative of the invention as described more fully 

in the claims which follow thereafter. 
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Experimental Details 

I . The Design of Combinatorial Fluorescence Energy 
Transfer Tags 

5 

Background: Optical interactions persist between two 
chromophores even when they are as far as 80 
angstroms apart. The chromophore with high energy 
absorption is defined as a donor, and the chromophore 

10 with lower energy absorption is defined as an 

acceptor. Fluorescence energy transfer is mediated 
by a dipole-dipole coupling between the chromophores 
that results in resonance transfer of excitation 
energy from an excited donor molecule to an acceptor 

15 (Forster, 1965) . Forster established that the energy 

transfer efficiency is proportional to the inverse of 
the sixth power of the distance between the two 
chromophores. Fluorescence resonance energy transfer 
has been used extensively as a spectroscopic ruler 

20 for biological structures (Stryer, 1978), and energy 

transfer- coupled tandem phycobiliprotein conjugates 
have found wide applications as unique fluorescent 
labels (Glazer and Stryer, 1983) . A set of 

polycationic heterodimeric fluorophores that exploit 

25 energy transfer and have high affinities for double- 

stranded DNA were also developed, offering advantages 
over monomeric fluorophores in multiplex fluorescence 
labeling applications (Benson et al., 1993; Rye et 
al., 1993). By exploiting fluorescence energy 

30 transfer principle, using a common donor and four 

different acceptors, four sets of ET primers and 
dideoxynucleotides were constructed that are markedly 
superior to single dye labels in DNA sequencing, and 
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in multiplex polymerase chain reaction (PCR) -based 
mapping and sizing protocols (Ju et al . , 1995, 1996). 

The present application discloses how energy transfer 
5 and combinatorial concepts can be used to tune the 

fluorescence emission signature of fluorescent tags 
for the development of a large number of 
combinatorial fluorescence energy transfer (CFET) 
tags. A schematic construction of the tags is shown 

10 in Figure la. Representative examples for the 

construction of the CFET tags and their expected 
fluorescence signatures are shown in Table 1. Three 
individual fluorescent dyes, 6-carboxyfluorescein 
( FAM or F) , N, N, N 1 , N 1 -tetramethyl-6-carboxyrhodamine 

15 (TAM or T) and Cyanine dye (Cy5 or C) are selected as 

examples to construct the CFET tags. The 
fluorescence emission maxima for FAM, TAM and Cy5 are 
525nm, 580nm and 670nm / respectively. Chemical 
moieties used as spacers are selected to construct 

20 various CFET tags aimed at conveniently labeling 

biomolecules and other targets of interest, monomers 
are convenient to employ. Other spacer moieties 
include nucleotides, peptides and 1' 2' -dideoxyribose 
phosphates. As shown in Table 1, tag 1 is constructed 

25 with FAM alone and displays its characteristic 

fluorescence signature (Xmax = 525 nm) . Any 
f luorophore with a characteristic fluorescence 
signature could be used in place of FAM. With FAM as 
a donor and TAM as an acceptor, CFET tags 2, 3, 4, 

30 and 5 can be constructed by changing the distance >X R" 

between the FAM and TAM chromophores . The rationale 
is that altering the distance between donor and 
acceptor changes the energy transfer efficiency, and 
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therefore the ratio of the fluorescence emission 
intensity of the donor (FAM) and acceptor (TAM) . 
Similarly, with FAM as a donor and Cy5 as an 
acceptor, CFET tags 6, 7 and 8 can be generated. 
5 With three dyes, with FAM as a donor, TAM as an 

acceptor for FAM and as a donor for 
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Table 1 


Representative Example of CFET Tags 




CFET Tag 


Fluorescence Signature 


Tag ID 


i ° 

F 


i 

525 nm 


1 


R=^I O 

F R T 


1 

525 nm 580 nm 


2 


F R T 


i 1 

525 nm 580 nm 


3 


F R T 


J I 

525 nm 580 nm 


4 


F R T 


J i 

525 nm 580 nm 


5 


F R C 


■ i 

525 nm 670 nm 


6 


R=?n — 0 

F C 


1 1 

525 nm 670 nm 


7 


F R C 


1 

1 1 

525 nm 670 nm 


8 


F Rl T ^ C 


i 1 1 

525 nm 580 nm 670 nm 


9 


F T ^ C 


J 1 i 

525 nm 580 nm 670 nm 


10 
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Cy5, which acts as the final acceptor, CFET tags 9 
and 10 can be constructed by manipulating distances 
N> R1" and "R2". All the CFET tags can be excited with 
a single laser source and analyzed by simple 
5 detectors capable of capturing the emission 

signatures from each tag. In other embodiments, more 
than three dyes can be used. Alternatively just 
single chromophores can be used as long as they have 
unique fluorescence signatures. 

10 

The donor and acceptor fluorescent molecules are 
separated using convenient chemical moieties as 
spacers to tune the fluorescence signatures of the 
CFET tags. Examples of such spacer moieties include 
15 nucleotides, dideoxyribose phosphate, and amino 

acids. The construction of CFET tags involving three 
or more different dyes is more challenging, since 
synthetic procedures need to be designed for 
introducing the individual dye molecules at specific 
20 locations on the spacing backbone. As an example, 

CFET tags involving three dyes can be constructed 
using oligonucleotides as spacers. An oligonucleotide 
with the sequence 5 ' -TTTTTTTTTTTTTTTTTTTTTTTTTC-3 ' 
(SEQ ID NO: 1) was selected as a scaffold to 
25 covalently attach FAM, TAM and Cy5 . FAM is 

introduced by using a 6-FAM-dT phosphoramidite, TAM 
is introduced by using TAM-dT (Glen Research, 
Sterling, VA) , and a modified T having an amino 
linker at the C-5 position (Glen Research) is 
30 incorporated into the oligonucleotide which is then 

linked to Cy5 - N-Hydroxy succinimide (NHS) ester. 
The final product is purified by size exclusion 
chromatography and gel electrophoresis. A 
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representative reaction for the construction of CFET 
tag. F-4-T-6-C (the numbers refer to the number of 
spacing nucleotides) involving FAM, TAM and Cy5 is 
shown in Figure 1. By changing the spacing between 
5 FAM and TAM, and TAM and Cy5, two CFET tags F-4-T-6-C 

and F-7-T-3-C with the fluorescence signatures 
corresponding to tags 9 and 10 have, been constructed 
as shown in Figure 2. Shown are the 

ultraviolet/visible absorption spectrum of F-4-T-6-C 

10 (Figure 2B) as well as the fluorescence emission 

spectra for F-4-T-6-C and F-7-T-3-C (Figures 2C and 
2D), with excitation at 488 nm (Ix Tris-Borate- 
Ethylenediaminetetraacetic acid (TBE) solution) . The 
UV/visible spectrum exhibits the characteristic 

15 absorption of FAM at 4 95 nm, TAM at 555 nm and Cy5 at 

649 nm (Figure 2B) . The fluorescence emission 

spectrum of F-4-T-6-C displays a fluorescence 
"signature with Cy5 highest, TAM next and FAM lowest; 
whereas F-7-T-3-C displays a fluorescence signature 

20 with FAM highest, TAM next and Cy5 lowest. The two 

fluorescence signatures are clearly different, and 
easily discernible by spectroscopic methods. Here 
the feasibility of the CFET approach involving three 
different dyes is clearly demonstrated. 

25 

It is evident that one can synthesize broad families 
of CFET tags. Examples of two synthetic approaches 
for constructing CFET tags are shown: (1) l',2'- 
dideoxyribose phosphate monomer can be used as a 
30 spacer to separate dyes used for labeling 

oligonucleotide primers, which can be assembled on a 
DNA synthesizer; (2) a rigid peptide linker can be 
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used to construct a CFET cassette to label any other 
molecular targets . 

The first example is shown in Figure 3A. A polymer 
5 linker (SSS...SSSS) formed by 1' , 2' -dideoxyribose 

phosphates (S) at the 5' end of the desired primer 
sequence forms a universal spacer for attaching the 
ET-coupled f luorophores , thereby producing an ET 
cassette. The 1 2 1 -dideoxyribose phosphates can be 

10 introduced using 5 ' -dimethoxytrityl-1 1 , 2 1 - 

dideoxyribose-3 1 - [ (2-cyanoethyl ) - (N, N-diisopropyl ) ] - 
phosphoramidite (dSpacer CE Phosphoramidite, Glen 
Research, Sterling, VA) . dSpacer CE Phosphoramidite 
has previously been used to construct DNA sequencing 

15 primers (Ju et al . , 1996). In this CFET tag 

construction, FAM is used as a common donor. In a 
CFET tag consisting of two different fluorescent 
dyes, either TAM or Cy5 can be used as acceptors ; x 
whereas in a CFET tag consisting of three different 

20 fluorescent dyes, TAM can also be used as a donor for 

Cy5. The length of the spacing between each 

donor/acceptor pair can be changed systematically to 
achieve the expected fluorescence signatures as shown 
in Table 1. FAM and TAM can be introduced using 

25 phosphoramidite FAM-dT and TAM-dT and Cy5 can be 

introduced to the modified T carrying an amino linker 
as described above. The use of such spacers is 
advantageous in several aspects: (i) the spacer will 
not hybridize to any sequences within the DNA 

30 template and therefore false priming is avoided; (ii) 

the linkage of the spacer maintains the natural 
nucleic acid phosphate functionality, which avoids 
possible anomalies in electrophoretic mobility; and 
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(iii) the elimination of the aromatic base groups on 
the deoxyribose rings in the spacer may reduce the 
likelihood of fluorescence quenching. 

5 The second synthetic approach requires sophisticated 

selective synthetic chemistry procedures for the CFET 
tag construction. As an example, Figure 3B shows a 
general scheme for the construction of CFET- 
deoxyuridine triphosphate (dUTP) using poly-proline 

10 (P) peptide as a spacer. The spacing between each 

donor/acceptor pair can be changed systematically to 
achieve the expected fluorescence signatures as shown 
in Table 1- Figure 4 shows a scheme for the 
synthesis of CFET-dUTP consisting of Fam, Tarn and 

15 Cy5. Peptide synthesis procedure using tert- 

butylcarbonyl (t-Boc) chemistry * is employed on a 
peptide synthesizer to construct the scaffold of the 
desired molecules. Starting with a glycine-resin as 
C-terminal, a modified proline tagged with FAM (Fam- 

20 proline) is coupled to glycine, then proline monomers 

are added, followed by reacting with another modified 
proline that has a protected primary amino linker 
(TFA-NH-proline) for the subsequent incorporation of 
Tarn. Next, proline spacer is again added, followed 

25 by reacting with the azido-proline for the subsequent 

incorporation of Cy5 . After cleavage from the resin 
and removal of the trif luoroacetyl group, compound 1 
in Figure 4 is obtained. Compound 1 reacts with TAM- 
NHS ester to form compound 2, which will then react 

30 with Cy5-phosphine (3) to produce compound 4, which 

has all the three dyes incorporated. Cy5-phosphine 
(3) can be synthesized using the modified Staudinger 
reaction developed by Bertozzi (Saxon and Bertozzi, 
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2000) . Conversion of compound 4 to an NHS ester 
produces 5, which is then coupled to Aminoallyl (AA) - 
dUTP (Sigma) to generate the final product CFET-dUTP • 
By varying the number of proline spacers between Fam 
5 and Tam, and between Tarn and Cy5, a library of CFET- 

dUTPs with unique fluorescence signatures can be 
developed. The intermediates 2, 4, 5, and the final 
products can be purified by high pressure liquid 
chromatography (HPLC) , size exclusion chromatography 
10 and gel electrophoresis. The structures of AA-dUTP, 

Fam-Proline, and NHS esters of TAM and Cy5 are shown 
in Figure 5. Brief synthetic schemes for the 
synthesis of trif luoroacetic (TFA) - NH-proline, Fam- 
proline, azido-proline and Cy5-phosphine are shown in 
15 Figure 6. Unique fluorescence signatures for 8 

synthesized CFET tags are shown in Figure 7. 

II. Biomedical Applications of Combinatorial 
Fluorescence Energy Transfer Tags 

The ability to sequence DNA accurately and rapidly is 
revolutionizing biology and medicine. The confluence 
of the massive Human Genome Project is driving an 
exponential growth in the development of high 
throughput genetic analysis technologies. This rapid 
technological development involving biology, 
chemistry, computer science, and engineering makes it 
possible to move from studying genes one by one to 
approaches which can analyze and compare entire 
genomes . 

Sophisticated techniques have enabled large-scale 
dissection of genomes. For instance, the development 
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of cloning vectors which can maintain and reproduce 
large stretches of DNA (up to a million bases) has 
resulted in clone libraries which span most of the 
chromosomes from end to end for many of the highly 
5 studied organisms including humans - so-called 

physical maps. Recognizing sequence markers that 
differ from one individual to another across the 
human genome has permitted them to be followed in 
families that harbor genetic diseases. If a marker 
cosegregates with the disease phenotype, one can be 
assured that the marker is in the vicinity of the 
gene responsible for that disease. Automated 
sequencing methods have made it possible to obtain 
the complete chemical composition of the genome with 
unprecedented speed, and computational approaches are 
beginning to allow annotation of these sequences, 
identification of the genes and other elements that 
comprise the chromosomes- Gene expression has moved 
from the arena of analyzing a few genes at a time by 
the techniques of Northern blot analysis, to creating 
vast microarrays of these genes on glass slides or 
silicon chips {Schena et al . 1995, Chee et al . 1996). 
Methods for identifying single nucleotide 

polymorphisms (SNPs) (Chen and Kwok, 1997), DNA- 
protein and protein-protein interactions (Uetz et al . 
2000), and members of metabolic, signal transduction 
and other pathways are also being developed. All 
these advances will have the potential to 
revolutionize medical and clinical research in 
establishing diagnostic, prognostic or treatment 
options . 
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It is noteworthy that many of the genomic techniques 
mentioned have benefited from the use of novel 
molecular tags, especially fluorescent dye molecules. 
DNA sequencing serves as a good example for 
5 evaluating the impact of this technology. Although 

the ability to obtain DNA sequences originated in the 
late 1970 f s with the development of the chemical 
cleavage approach of Maxam and Gilbert (1977) and the 
dideoxynucleotide terminator approach of Sanger et 

10 al. (1977), it was the latter that was most amenable 

to automation and fluorescent labeling strategies. 
In the past 15 years, in rapid succession, the 
ability to use four dyes in a single sequencing lane, 
one for each of the four bases in DNA (Smith et al. 

15 1986), the ability to use cycle sequencing with heat 

stable enzymes (Tabor et al. 1995), the development 
of energy transfer dyes which produced higher signals 
(Ju et al., 1995; Lee et al . , 1997), and more 
recently, the ability to obtain long sequence reads 

20 in separate capillary tubes instead of adjacent lanes 

on polyacrylamide slab gels, has made sequencing 
increasingly robust. Future improvements in 

sequencing technology, including miniaturization and 
solid phase approaches, will continue to take 

25 advantage of energy transfer (ET) and other novel 

fluorescent tags (Ju et al . , 1997). Investigators 
are also utilizing ET dyes for investigating gene 
expression on microarrays (Hacia et al. 1998). All 
of these approaches are believed to be limited to 

30 single pairs of donor and acceptor dyes for each 

reaction. The CFET approach described herein whereby 
one, two. or more dyes, disposed at varying molecular 
distances from each other to generate many 
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alternative discrete signatures offers the 
possibility of obtaining an order of magnitude higher 
throughput in many of these genomic approaches. 
Genetic mutation and chromosome analysis are two 
5 examples of the biomedical application of these CFET 

tags. Using CFET tags in combination with single 
fluorophore tags, and/or multiple dye tags where no 
FET occurs, the number of possible unique 
fluorescence signatures, and hence the number of e.g. 
10 SNPs detectable simultaneously, is hugely increased. 

Gene mutations play important roles in the 
development of many human diseases. It has become 
increasingly apparent that missense mutations (single 

15 base changes usually culminating in amino acid 

changes or introduction of stop codons which lead to 
truncated proteins) , microdeletions and 

microinsertions (both of which can change the reading 
frame and also usually lead to protein truncation) 

20 can occur at many positions along the length of the 

responsible gene. A number of studies have sought to 
identify causative mutations and predisposing 
polymorphisms for a number of cancers and other 
diseases. These include chronic lymphocytic leukemia 

25 and other blood cancers (Kalachikov et al. 1997; Qu 

et al. 1998), the long QT syndrome (an ionic 
disturbance in the heart visible on 

electrocardiograms and an important risk factor for 
sudden cardiac death), breast cancer (Fischer et al . 

30 1996), the rare ICF syndrome (immune 

def iciency/centromeric instability/facial anomalies ) 
(Xu et al. 1999), and more recently such complex 
disorders such as asthma and diabetes. 
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With the exception of the types of small mutations 
described above and single nucleotide polymorphisms 
that occur, on average, every 1000 nucleotides, the 6 
5 billion nucleotide pairs that make up the diploid 

human genome are largely identical from individual to 
individual. Nonetheless, large deletions, 

amplifications and rearrangements do occur, and such 
chromosomal anomalies are often associated with 

10 serious and life-threatening diseases. The best 

known example is probably the third copy of 
chromosome 21 in individuals with Down syndrome, but 
many other chromosomal translocations and 
macrodeletions are associated with cancer and other 

15 disease syndromes. If one is able to mark the 

positions along chromosomes with identifiable "color- 
coded" probes, it should be possible to easily detect 
such large-scale changes in chromosomal geography. 
In fact, the field of chromosome painting (multicolor 

20 fluorescence In situ hybridization (M-FISH) has been 

used for just such analyses (Speicher et al . 1996). A 
larger set of more readily separable CFET tag 
signatures might greatly aid in this enterprise. The 
established chromosome painting techniques require 

25 appropriate mixing of the different dyes prior to 

labeling, and so are used almost exclusively for 
labeling whole chromosomes. 

III. CFET Tags for Multiplex Gene Mutation Detection 
3 0 Using Ligase Chain Reaction 

Ligase chain reaction (LCR) is a procedure for 
genetic mutation analysis using ligase and a pair of 
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oligonucleotides (Eggerding, 1995; Wu and Wallace, 
1989; Landegren et al . , 1988). Briefly, it is based 
on the fact that two adjacent oligonucleotides can 
only be ligated if the' adjoining bases are 
5 complementary to the template strand. If there is a 

single base difference within two bases of the join 
site, ligation will not occur. Pairs of 

oligonucleotides are designed spanning the ligation 
site on the template DNA, including one harboring 

10 either the wild-type or mutated base. In the usual 

procedure, one of the oligonucleotides is 
radiolabeled at the phosphate group at its 5' end. 
Following the ligase chain reaction, which involves 
multiple rounds of denaturing, primer annealing and 

15 ligation, one can separate the products from the 

substrates on polyacrylamide gels. The procedure can 
be modified using single stranded DNA template as 
shown in Figure 8 for testing using the CFET tags. 
Primer pairs are generated surrounding a base that 

20 can be mutated. For example, the template may 

contain a T (wild-type, wt) or C (mutated, mut) at 
the relevant position. The wt primers are 
complementary to the wt template at every position. 
The primer on the right side of Figure 8A is labeled 

25 with CFET tag 1 to yield a specific fluorescent 

signature. The mutation-specific primer, two bases 
longer than its wild-type analog, is complementary to 
every position of the mutated template. This primer 
is labeled with CFET tag 2 displaying another unique 

30 fluorescent signature. A common 20 base pair primer 

will be used on the other side of the ligation site. 
In cases where ligation does not occur, because a 
wild-type oligonucleotide was used with a mutated 
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template sequence, or a mutated oligonucleotide was 
used with a wild-type template sequence, the only 
fluorescent band on the acrylamide gel will be the 
size of the tagged primer. In contrast, if there is 
5 no mismatch at the ligation junction, two fluorescent 

bands, one the size of the primer and one the size of 
the joined primers will form- Following ligase chain 
reaction, the left and right primer will be ligated 
only if they are completely complementary to the 

10 template. Thus, with a wt template, only a 40 base 

product will result, and only a 42 base product will 
result from a mut template as shown in Figure 8B. By 
virtue of the unique fluorescence emission signatures 
of the CFET tags, it is possible to display the 

15 products of several mutation positions 

simultaneously, each labeled with a different CFET 
Tag. The ligated products can be separated and 
analyzed in a single gel lane. In order to 

accomplish this, the multiplex set of 

20 oligonucleotides that contain the potentially mutated 

position can be 5' -end labeled, each with a specific 
CFET tag. For example, one can test four different 
mutation sites using eight distinct CFET tags. 

25 As shown in Table 2, eight primers labeled with eight 

unique CFET tags (1, 2, 3, 4, 5, 6, 9, and 10 of 
Table 1) can be constructed as shown in the general 
labeling scheme in Figure 3A using 1' , 2' -dideoxysugar 
phosphate (S) as spacers. For this set of CFET tag 

30 constructs, FAM is used as a common donor, and TAM 

and/or Cy5 as acceptors. The length of the spacing 
between each donor/acceptor pair, (S) m and (S) n/ can 
be changed systematically to achieve the expected 
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fluorescence signatures as depicted in Table 1. FAM 
and TAM can be introduced using FAM-dT and TAM-dT 
phosphoramidites and Cy5 can be introduced to the 
modified T carrying an amino linker as described 
5 above. 

The system can be tested, for example, by 
synthesizing single stranded DNA templates mimicking 
known single base mutations in exon 20 of the 

10 retinoblastoma susceptibility (RBI) gene (Schubert et 

al. 1994, Lohmann 1999). The sequences of two sets 
of synthetic templates (wt and mut) which can be used 
in the analysis are shown in Table 3. The sequence 
of the potential mutation positions is shown in bold- 

15 face as "A' , "C", "G" and M T". Primer sets 1 and 2 

in Table 2 are used for the testing of both wild type 
and mutated base positions of Template A, 
respectively; while primer sets 3 and 4 are for 
testing both wild type and mutated base positions of 

20 Template B, respectively. To maximize the number of 

samples that can be detected on a polyacrylamide gel, 
the primers surrounding each "mutated" position can 
be designed to be a unique length as shown in Figure 
9. For example, the two CFET labeled oligonucleotides 

25 (one for the wild-type gene and one for the mutated 

gene) surrounding mutation position 1 are 20 and 22 
bases long, respectively, and the unlabeled common 
primer is 20 bases long. Any resulting ligation 
product will be either 40 or 42 bases long. 

30 Likewise, for mutation position 2, 24 and 26 base 

labeled oligonucleotides can be constructed, as well 
as a different 20 base common primer, leading to 
ligation products of either 44 or 46 bases. More 
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primers can, of course, be generated by making the 
sizing increment one base instead of two bases for 
each different mutation, or creating a second set of 
labeled primers whose ligation products run between 
5 80 and 98 base pairs, between 120 and 138 base pairs, 

etc. Since single base pair resolution up to the 
length of ~ 400 bp DNA fragments is easily achieved 
in polyacrylamide gel electrophoresis, the ligated 
products can be readily resolved in such standard 

10 fluorescent gel systems. Furthermore, the advantage 

of being able to clearly distinguish the products 
based on their fluorescent signatures, as well as 
size, makes this assay extremely powerful. Expected 
gel electrophoresis results for this multiplex 

15 testing system are shown on the right side of Figure 

9. Here, template collection 1 is seen to contain 
only wt sequences. In contrast, template pool 2 
contains one template with a mutation at position 2 
and a heterozygote genotype at position 4. 
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Table 2. Eight primers used for multiplex mutation 
detection 



5 Primer 1L: 3 1 -ttaaaaagaataagggtgtc-5 1 

(SEQ ID NO: 2) 
Primer 1R wt: 3 ■ -Acatagccgatcggatagag-5 1 -CFET1 

(SEQ ID NO: 3) 
Primer 1R mut : 3 1 -Tcatagccgatcggatagaggc-5 1 -CFET2 
10 (SEQ ID NO: 4) 

Primer 2L: 3 1 -acatagccgatcggatagag-5 1 

(SEQ ID NO: 5) 
Primer 2R wt : 3 1 -Gccgatttatgtgaaacacttgcg-5 1 -CFET3 
15 (SEQ ID NO: 6) 

Primer 2R mut: 3 1 -Accgatttatgtgaaacacttgcgga-5 1 -CFET4 

(SEQ ID NO: 7) 

Primer 3L : 3 1 -cggaagacagactcgtgggt-5 1 

20 (SEQ ID NO: 8) 

Primer 3R wt : 3 f -Cttaatcttgtatagtagacctgggaaa-5 1 -CFET5 

(SEQ ID NO: 9) 

Primer 3R mut: 3 1 -Attaatcttgtatagtagacctgggaaaag-5 ' -CFET6 

(SEQ ID NO: 10) 

25 

Primer 4L : 3 1 -atagtagacctgggaaaagg-5 1 

(SEQ ID NO: 11) 

Primer 4R wt : 3 1 -Tcgtgtgggacgtcttactcatacttgagt-5 ' -CFET9 

(SEQ ID NO: 12) 

30 Primer 4R mut: 3 • -Gcgtgtgggacgtcttactcatacttgagtac-5 ' CFET10 

(SEQ ID NO: 13) 
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Table 3. The sequence of the two sets of synthetic 
templates (wt and mut) 

5 Template A: 

5 » -gtaaaaatgactaatttttcttattcccacagTgtatcggctagcctatc 
tcCggctaaatacactttgtgaacgccttctgtctgagcacccagaatta-3 ? 
(wild type) (SEQ ID NO: 14) 

10 5 1 -gtaaaaatgactaatttttcttattcccacagAgtatcggctagcctatc 

tcTggctaaatacactttgtgaacgccttctgtctgagcacccagaatta-3 1 
(mutated) (SEQ ID NO: 15) 



Template B: 

!5 5 ' -tacactttgtgaacgccttctgtctgagcacccaGaattagaacatatca 

tctggacccttttccAgcacaccctgcagaatgagtatgaactcatgaga-3 1 
(wild type) (SEQ ID NO: 16) 

5 » -tacactttgtgaacgccttctgtctgagcacccaTaattagaacatatca 
2 0 tctggacccttttccCgcacaccctgcagaatgagtatgaactcatgaga-3 1 

(mutated) (SEQ ID NO: 17) 
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IV. CFET Tag Labeled Probes for Chromosome-wide 
Analysis 

5 Probes can be generated using a random primed 

labeling method to incorporate CFET-dUTP into 
chromosome-specific DNA molecules or cosmids disposed 
along the length of a given chromosome. Metaphase 
spreads of fresh cells or deparaf f inized material can 

10 be prepared by standard methodologies, and the tagged 

probes can be hybridized to the chromosomes. Bulky 
ET dyes consisting of two individual fluorescent 
molecules, as well as dyes with a long linker, have 
been attached to deoxynucleotides (dNTPs) and 

15 dideoxynucleotides (ddNTPs) which have been shown to 

be good substrates for DNA polymerase (Rosenblum et 
al. 1997, Zhu et al . 1994). Thus, the CFET-dUTP 
should be able to be incorporated into the growing 
strand by the polymerase reaction. In the actual 

20 random priming reaction, the ratio of regular 

deoxythymine triphosphate (dTTP) and CFET-dUTP can be 
adjusted, so that only a small portion of CFET-dUTP 
will be incorporated into the growing chain, just 
enough to be detected by the optical method. 



25 



30 



Numerical and structural chromosome rearrangements 
are a major cause of human mortality and morbidity. 
Aneuploidy of whole chromosomes accounts for at least 
50% of early embryonic lethality, and also leads to 
severe patterns of congenital malformation such as 
Down syndrome. Segmental aneuploidies due to 

deletions and duplications also lead to malformation 
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syndromes, as well as being associated with many 
types of cancer. 

Traditional cytogenetic analysis is hampered by 
5 problems of resolution and interpretation inherent in 

standard banding analysis. In the last decade the use 
of fluorescent labeled DNA probes on chromosome 
preparations as well as on interphase nuclei has 
greatly improved the resolution and accuracy of 

10 cytogenetic diagnosis. Microdeletions and 

amplifications too small to be visible under the 
light microscope by banding can now be visualized 
using chromosome and region specific f luorescently 
labeled probes. Multiplexing this system is possible 

15 using combinations of probes labeled with different 

fluors. Sets of up to five differently labeled probes 
have been used for diagnostic purposes on interphase 
nuclei to determine aneuploidy in prenatal samples 
(Munne et al . 1998). M-FISH and Spectral Karyotyping 

20 use a combinatorial approach of five dyes to "paint" 

all 23 pairs of human chromosomes so they can be 
distinguished using computerized image software 
(Schrock et al. 1996, Speicher et al. 1996). 

However, these established techniques require careful 

25 mixing of dyes in controlled ratios. Quality control 

is often a problem, and the commercially available 
probes are very expensive. 

CFET Tags are expected to have a substantial 
30 advantage over currently available dye sets. It 

should be possible to generate a larger number of 
CFET tag sets, reducing the need for a combinatorial 
approach. Quality control is also likely to be 
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easier, since each probe needs to be labeled with 
only one tag, and probe sets can be mixed in equal 
quantities to produce multicolor FISH reagents. 

5 CFET Tags for example could be used both for the 

detection of aneuploidy in interphase nuclei, and for 
the detection of submicroscopic chromosomal deletions 
and amplifications- For aneuploidy detection, for 
example, a set of eight different CFET tag labeled 
10 probes can be prepared, each specific for one of the 

chromosomes most commonly involved in aneuploidy in 
either embryonic losses or birth defects (chromosomes 
13, 15, 16, 18, 21, 22, X and Y) . 

15 A schematic of a procedure for comprehensive 

chromosome-wide analysis for gain or loss of genetic 
material is shown in Figure 10. In the example, 
eight probes each labeled with a CFET-dUTP that emits 
a unique fluorescence signature are hybridized along 

20 a chromosome in eight separate locations. The normal 

chromosome A will display eight unique fluorescence 
signatures of each probe in a defined order. A loss 
of fluorescence signature "2" in chromosome B will 
indicate the deletion of the complementary sequence 

25 of probe 2. Whereas, in chromosome C, the appearance 

of two signatures of M 3" will indicate the expansion 
of the complementary sequences for probe 3. 

Standard sets of cosmid and BisAcryloylCystamine 
30 (BAC) markers at 2-3 Mb intervals along the 

chromosomes are being developed in several 
laboratories, including a National Cancer Institute 
sponsored project, the Cancer Chromosome Aberration 
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Project (CCAP: webpage 

www . ncbi . nlm. nih ■ qov/ncicqap / ) ■ Sets of 

differentially CFET-labeled ordered probes specific 
for particular chromosomal regions can be prepared. 
5 Using FISH, one can then determine the limits of 

suspected or known deletions . 

V. Use of CFET Tags In Other Multi-Component 
Analyses 

10 

The CFET tags with unique fluorescence signatures 
which are disclosed in the present application will 
have utility in other applications involving multi 
component analysis in addition to those disclosed 

15 above. Additional applications include, but are not 

limited to, multiplex assays including binding assays 
and immuno assays, detection of microbial pathogens, 
monitoring multiple biomolecular reactions, screening 
of drugs or compounds, epitope mapping, allergy 

20 screening, and use with organic compounds and in 

material science. For example, multiple reactions or 
interactions can be measured simultaneously, where 
multiple CFET tags, each with a different 
fluorescence signature, are used to label the 

25 different reactants which could include, for example, 

antibodies, antigens, ligands, or substrates. 
Examples include antibody-antigen and receptor-ligand 
binding. In further examples, different reactants 
can be coupled to microspheres. 
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VI. CFET Tags Used in Ligation Assay to Identify 
Multiple Single Nucleotide Polymorphisms . 

As an example of application for biological assays, 
5 the CFET tags were applied to an oligonucleotide 

ligation assay (Landegren, 1988) coupled with solid 
phase purification to detect genetic mutations on 
exon 20 of the tumor suppressor retinoblastoma (RBI) 
gene. The schematic of the approach is shown in Fig. 

10 11. Two 20 base-pair oligonucleotides, one. labeled 

with a CFET tag at the 5' end and the other labeled 
with a biotin at the 3' end and a monophosphate (P) 
group at the 5' end, are hybridized to the target DNA 
template such that the 3 % end of the CFET-labeled 

15 oligonucleotide is positioned next to the 5' end of 

the biotinylated oligonucleotide. Taq DNA ligase 
joins the two juxtaposed oligonucleotides in a head- 
to-tail fashion by forming a phosphodiester bond, 
provided that the nucleotides at the ligating 

20 junction of the two oligonucleotides are correctly 

base-paired with the template (Barany, 1991) . Under 
the experimental conditions using Taq DNA ligase, no 
ligation reaction occurs when there is a mismatch 
between the 3' end of the CFET-labeled probe 

25 (nucleotides A and C, Fig. 11) and the SNP site 

(nucleotides T and G, Fig. 11) on the target 
template. After the ligation, the CFET-labeled 

ligation products (40 base-pair) are immobilized to 
streptavidin-coat-ed magnetic beads while the other 

30 components are washed away. The ligation products are 

then cleaved from the magnetic beads by denaturing 
the biotin-streptayidin interaction with formamide 
and analyzed with a three-color fluorescence CAE 
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system. The CFET-labeled ligation products are 
unambiguously detected due to their distinct mobility 
and unique fluorescence signatures in the 
electropherogram, see Figure 12. In the case of 
5 heterozygotes at the SNP site, two CFET tags with 

different fluorescence signature and electrophoretic 
mobility are used to label the oligonucleotides 
corresponding to each allele. The unique fluorescence 
signatures in the electropherogram thus identify each 

10 of the corresponding SNPs . The solid phase procedure 

completely eliminates the unligated CFET-labeled 
oligonucleotide. Although the unligated 20 base-pair 
biotinylated oligonucleotides are also captured by 
the magnetic beads, they do not produce fluorescence 

15 signals due to the absence of CFET tags. The CFET tag 

library in this application detects multiple SNPs on 
the target DNA template simultaneously. 

Exon 20 of the tumor suppressor RBI gene (Schubert, 
20 1994) was selected as a model system to test - the 

utility of the CFET tags. Several SNPs within a 
region of 200 base pairs in the RBI gene have been 
found, which are well suited for evaluating a genetic 
mutation analysis system. Six ligation reactions were 
25 carried out separately using six different CFET tags 

on synthetic templates mimicking exon 20 of the RBI 
gene where multiple SNPs (six nucleotide variations) 
are located. After the ligation and solid phase 
purification, the ligation products were combined in 
30 a single tube and analyzed with a three-color CAE 

system, resulting in the simultaneous detection of 
six nucleotide variations by the unique fluorescence 
signatures of the CFET-labeled ligation products (see 
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Figure 12A) . The unique fluorescence signatures were 
spatially resolved in the electropherogram as a 
result of the different mobility of the CFET-labeled 
ligation products. In this model experiment, both 
5 CFET-1 (FAM) and CFET-6 (F-10-Cy5) detect homozygous 

SNPs (T/T) . CFET-3 (F-9-T) and CFET-4 (F-13-T) 
clearly distinguish a mimic of RBI gene mutation 
R661W (amino acid change from arginine to tryptophan 
due to mutation in codon 661) by detecting both the 

10 wild type (C) and " the mutation (T) . CFET-7 (F-4-T-6- 

Cy5) and CFET-8 (F-7-T~7-Cy5) identify another 
mutation Q685P (amino acid change from glutamine to 
proline due to mutation in codon 685) with 
heterozygous genotype (A/C) . To validate the CFET 

15 technology further used three CFET-labeled 

oligonucleotide probes (CFET-1, 3 and 7) and their 
corresponding biotinylated oligonucleotides to 
identify three SNPs using a PCR product amplified 
from exon 20 of the RBI gene from patient genomic 

20 DNA . The ligation reactions were performed in a 

single tube and the reaction products were loaded 
onto a three-color CAE system. Three individual 
homozygous SNPs (T, C and A) , that were verified by 
DNA sequencing, were unambiguously identified by the 

25 three distinct fluorescence signatures from the CFET 

tags (figure 12B) : T (FAM, CFET-1), C (F-9-T, CFET-3) 
and A (F-4-T-6-Cy5, CFET-7) . Thus, the approach 
described here can detect both heterozygotes and 
homozygotes unambiguously because of the unique CFET 

30 fluorescence signature and mobility in the 

electropherogram. 
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To increase the level of control available in 
isolation other isolation-permitting moieties besides 
biotin may be employed such as phenylboronic acid. 
Attachment of the moieties via cleavable linker 
5 molecules enhances this still further. 

VII . CFET Tags Used in Single Base Primer Extension 
to Identify Multiple Single Nucleotide Polymorphisms. 

Single base extension for each dye-labeled primer was 
done by mixing 0.5 to 1 pmol of the primers with 1 
pmol of template, followed by adding 2 pi of thermo 
sequenase 10X reaction buffer (260 mM Tris-HCl, 65 mM 
MgCl 2 / pH 9.5, Amersham Pharmacia Biotech, Piscataway, 
NJ) , 5 pi of water, 1 pmol of biotinylated 
dideoxynucleoside triphosphates (Biotin-ll-ddNTP, 
NEN, Boston, MA) and 1 unit of thermo sequenase in 20 
mM Tris-HCl, pH 8.5, 50% glycerol, 0.1 mM 
ethylenediamine tetraacetic acid (EDTA) , 0.5% Tween™- 
20 (v/v) , 0.5% NonidetTM P-40 (v/v) , ImM 
dithiothreitol (DTT) , 100 mM KC1 and 0.053 unit/pl 
Thermoplasma acidophilum inorganic pyrophosphatase 
(Amersham Pharmacia Biotech) . The reaction mixture 
was incubated at 54 °C for 30 sec for single base 
extension . 

Schematic representation of the multiplex SNPs 
detection using CFET tags and biotinylated 
dideoxynucleotides is shown, in Figure 13. In this 
30 example, extension of the primers are initiated by 

ddCTP-Biotin (for primer 1) and ddGTP-Biotin (for 
primer 2) in the presence of DNA polymerase if there 
is a match between the 3' end of the primer and the 
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template (X and Y for primer 1; X' and Y f for primer 
2) . The extension products are isolated using 
streptavidin-coated magnetic beads. Upon denaturing, 
washing and releasing from the beads, the extension 
5 products are loaded onto an electrophoresis system 

and the. resulting fluorescence signatures from the 
electropherogram identify each of the unique SNPs . 
Thus, the CFET-labeled oligonucleotides, DNA 
polymerase and biotinylated dideoxynucleotides form a 

10 high fidelity SNP detection system in which the base 

at the 3' end of the oligonucleotides dictates its 
extension by incorporating a specific biotinylated 
dideoxynucleotide. The CFET tags used were F, F-9-T 
and F-13-T. Their unique fluorescence signatures are 

15 shown in Figures 14 and 15 

To increase the level of control over isolation, 
other isolation-permitting moieties such as 
phenylboronic acid, antigens or antibodies may be 
20 employed in place of the biotin. Attachment of the 

moieties via cleavable linker molecules enhances this 
still further. 



25 VIII. High Throughput Analyses. 

The throughput of the multiplex analyses offered by 
the use of the CFET tags can be increased by 
performing the analyses in the high throughput 
chamber illustrated in figure 16. 

30 

IX. In combination with non-FET tags. 

To increase the number of different unique 
fluorescent signatures available in any set of tags 
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CFET tags can be used in combination with single 
chromophore/f luorophore tags and tags with multiple 
chromophores/fluorophores where no FET occurs. The 
number of possible different fluorescence signatures 
using such combinations is huge, and would greatly 
aid multiplex analyses. Such fluorophores could be 
quantum dots, luminescent molecules of fluorescent 
dyes. For example, each tag could be used to detect a 
different SNP using the exemplified assays. 



10 
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What is claimed is: 



1 . A composition of matter comprising multiple 
fluorophores, each of which is bound to a 
5 molecular scaffold at a separate predetermined 

position on the scaffold, such separate 
predetermined positions being selected so as to 
permit fluorescence energy transfer between one 
such fluorophore and another such fluorophore, 
10 wherein the one such fluorophore and the another 

such fluorophore are characterized by the maximum 
emission wavelength of one being greater than the 
minimum excitation wavelength of the other. 



15 2 . A composition of matter of claim 1 comprising two 

f luorophores, each of which is bound to a 
molecular scaffold, at a separate predetermined 
position on the scaffold, such separate positions 
being selected so as to permit fluorescence 

20 energy transfer between such f luorophores , and 

such fluorophores being characterized by the 
maximum emission wavelength of one of the 
fluorophores being greater than the minimum 
excitation wavelength of the other fluorophore. 

25 

3. A composition of matter of claim 1 comprising 
three fluorophores each of which is bound to a 
molecular scaffold at a separate predetermined 
position on the scaffold, such separate 
30 predetermined positions being selected so as to 

permit fluorescence energy transfer among such 
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fluorophores and such fluorophores being 
characterized by the maximum emission wavelength 
of one such fluorophore being greater than the 
minimum excitation wavelength of the second such 
5 fluorophore and the maximum emission wavelength 

of such second fluorophore being greater than the 
minimum excitation wavelength of the third such 
fluorophore . 

10 4. The composition of matter of. the claim 1/ wherein 

each fluorophore is covalently bound to the 
molecular scaffold . 

5. The composition of claim 1, wherein the 
15 efficiency of the fluorescence energy transfer is 

less than 20%. 

6. The composition of claim 1, wherein the molecular 
scaffold is rigid. 

20 

7. The composition of claim 1, wherein the molecular 
scaffold is polymeric. 

8. The composition of claim 9, wherein the molecular 
25 scaffold comprises a nucleic acid. 



9. The composition of claim 9, wherein the molecular 
scaffold comprises a peptide. 



30 



10. The composition of claim 9, wherein the molecular 
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scaffold comprises a polyphosphate. 

11. The composition of claim 1, wherein at least one 
fluorophore is a fluorescent dye. 



12. The composition of claim 11, wherein the 
fluorescent dye is 6-carboxyf luorescein . 

13. The composition of claim 11, wherein the 
10 fluorescent dye is N, N, N ■ , N ' -tetramethyl-6- 

carboxyrhodamine . 



15 



14. The composition of claim 11, wherein the 
fluorescent dye is cyanine-5 monof unctional dye. 

15. The composition of claim 11, wherein at least one 
fluorophore is a luminescent molecule. 



20 



16. The composition of claim 11, wherein at least one 
fluorophore is a quantum dot. 



17. A composition of matter having the structure: 



25 



FAM 



TAM 

r 



Q-T-(S) m _ T - R 
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wherein S represents a 1 ? , 2 1 -dideoxyribose 
phosphate moiety/ m is an integer greater than 1 
and less than 100, each T represents a thymidine 
5 derivative, FAM represents 6-carboxyf luorescein 

derivative, TAM represents N,N,N' ,N' -tetramethyl- 
6-carboxyrhodamine derivative, each solid line 
represents a covalent bond, R represents either a 
hydroxy or phosphate terminus and Q represents 
10 either a hydroxy or phosphate terminus, with the 

proviso that R and Q are different. 

18. The composition of claim 17, wherein m is 4 . 

15 19. The composition of claim 17, wherein m is 6. 

20. The composition of claim 17, wherein m is 9. 

21. The composition of claim 17, wherein m is 13. 

20 

22. A composition of matter having the structure: 



FAM 



Cy5 



2f 



Q-T — (S) m — T-R 
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10 



wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety, m is an integer greater than 1 
and less than 100, T represents a thymidine 
derivative, FAM represents a 6-carboxyf luorescein 
derivative, Cy5 represents a cyanine-5 
monof unctional dye derivative, each solid line 
represents a covalent bond, R represents' either a 
hydroxy or phosphate terminus and Q represents 
either a hydroxy or phosphate terminus, with the 
proviso that R and Q are different. 



15 



23. The composition of claim 22, wherein in is 4 



24. The composition of claim 22, wherein m is 5. 



25. The composition of claim 22, wherein m is 7 



26. The composition of claim 22, wherein m is 10, 



20 



27. The composition of claim 22, wherein m is 13, 



28. A composition of matter comprising the structure 
shown below: 



25 



FAM 



TAM 



Cy5 



Q- T-(S) m -T-(S) n -T_ R 
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wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety, m is an integer greater than 1 
and less than 100, n is an integer greater than 1 
5 and less than 100, T represents a thymidine 

derivative, FAM represents a 6-carboxyf luorescein 
derivative, Cy5 represents a cyanine-5 
monof unctional dye derivative, TAM represents a 
N,N,N' , N' -tetramethyl-6-carboxyrhodamine 
10 derivative, each solid line represents a covalent 

bond, R represents either a hydroxy or phosphate 
terminus and Q represents either a hydroxy or 
phosphate terminus, with the proviso that R and Q 
are different - 

15 

29. The composition of claim 28, wherein m is 3, and 
n is 7 . 



30. The composition of claim 28, wherein m is 4, and 
20 n is 6. 

31. The composition of claim 28, wherein m is 5, and 
n is 5 

25 32. The composition of claim 28, wherein m is 6, and 

n is 6 . 



33. 



The composition of claim 28, wherein m is 7, and 
n is 7 . 
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34 . A composition of matter comprising the structure 
shown below: 



TAM 



TAM 

Q ZT^(S) m -T-R 



10 



15 



20 



wherein S represents a 1 1 , 2 1 -dideoxyribose 
phosphate moiety, m represents an integer greater 
than 1 and less than 100, T represents a 
thymidine derivative, and TAM represents a 
N, N, N' , N' -tetramethyl-6-carboxyrhodamine 
derivative, each solid line represents a covalent 
bond, R represents either a hydroxy or phosphate 
terminus and Q represents either a hydroxy or 
phosphate terminus, with the proviso that R and Q 
are different. 



35. The composition of claim 34, wherein m is 4 



25 



36. A nucleic acid labeled with the composition of 
any of claims 1, 17, 22, 28 and 34. 



37. The nucleic acid of claim 36, wherein the nucleic 
acid is DNA. 
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38. The nucleic acid of claim 36, wherein the nucleic 
acid is RNA. 

5 39. The nucleic acid of claim 36, wherein the nucleic 

acid is DNA/RNA. 

40. A method of determining whether a preselected 
nucleotide residue is present at a predetermined 
10 position within a nucleic acid comprising the 

steps of: 

contacting the nucleic acid, under 
hybridizing and DNA ligation-permitting 
conditions, with (i) a DNA ligase, (ii) a 

15 first oligonucleotide having affixed 

thereto a composition of matter of claim 1 
wherein the first oligonucleotide 

hybridizes with nucleotides immediately 
adjacent one side of the predetermined 

20 position and (iii) a second oligonucleotide 

which hybridizes with the nucleotides 
immediately adjacent the other side of the 
predetermined position, wherein the 
hydroxy-terminal residue of the 

25 oligonucleotide which hybridizes to the 

nucleotide located 3' of the predetermined 
position is a nucleotide which is 
complementary to the preselected nucleotide 
residue; and 

30 (b) detecting the presence of a ligation 

product comprising both the first and the 
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second oligonucleotides , the presence of 
such a ligation product indicating the 
presence of the preselected nucleotide 
residue at the predetermined position. 

5 

41. A method of determining whether at various 
predetermined positions within a nucleic acid, a 
preselected nucleotide residue is present at such 
position, wherein the preselected nucleotide 

10 residue may vary at different predetermined 

positions which comprises determining whether 
each preselected nucleotide is present each 
predetermined position according to the method of 
claim 42. 

15 

42. The method of claim 41, wherein the presence of a 
plurality of given nucleotide residues is 
determined simultaneously. 

20 43. The method of claim 40, wherein the DNA ligase is 

Taq DNA ligase. 

44. The method of claim 40, wherein the second 
oligonucleotide has an isolation-permitting 
25 moiety affixed thereto, and wherein the method 

further comprises the steps of isolating the 
moiety-containing molecules resulting from step 
(a) and determining the presence therein of 
ligated first and second oligonucleotides. 



30 
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45- The method of claim 40, wherein the composition 
of matter affixed to the first oligonucleotide 
has a predetermined emission spectrum, and 
wherein the observation of this emission spectrum 
5 is employed to determine the presence of ligated 

first and second oligonucleotides in step (b) . . 

46. A method of determining whether a preselected 
nucleotide residue is present at a predetermined 
10 position * within a nucleic acid ' comprising the 

steps of: 

(a) contacting the nucleic acid, under 
hybridizing and DNA polymerization- 
permitting conditions, with (i) a DNA 

15 polymerase, (ii) an oligonucleotide (1) 

having affixed thereto a composition of 
matter of claim 1, and (2) having a 
hydroxyl 3* terminus thereof, wherein the 
oligonucleotide hybridizes with the 3' 

20 region of the nucleic acid molecule 

flanking the predetermined position, and 
(iii) a dideoxynucleotide labeled with an 
isolation-permitting moiety, wherein the 
labeled dideoxynucleotide is complementary 

25 to the given nucleotide residue, 

with the proviso that upon hybridization of 
the oligonucleotide with the nucleic acid 
in the presence of DNA polymerase and the 
preselected nucleotide residue, the 
30 oligonucleotide and dideoxynucleotide are 

juxtaposed so as to permit their covalent 
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linkage by the DNA polymerase; 

(b) detecting the presence of a polymerization 
product comprising both the oligonucleotide 
and the dideoxynucleotide, the presence of 
5 such a polymerization product indicating 

the presence of the preselected nucleotide 
residue at the predetermined position. 



47 . A method of determining whether at various 
10 predetermined positions within a nucleic acid, a 

preselected nucleotide residue is present at such 
position, wherein the preselected nucleotide 
residue may vary at different predetermined 
positions which comprises determining whether 
15 each preselected nucleotide is present each 

predetermined position according to the method of 
claim 46. 



48. The method of claim 4 6, wherein the DNA 
20 polymerase is thermo sequenase . 

49. The method of claim 4 6, wherein the 
dideoxynucleotide is selected from the group 
consisting of dideoxyadenosine triphosphate, 

25 dideoxycytidine triphosphate, dideoxyguanosine 

triphosphate, dideoxythymidine triphosphate, and 
dideoxyuridine triphosphate . 



30 



50. 



The method of claim 4 6, wherein the composition 
of matter affixed to the oligonucleotide has a 
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predetermined emission spectrum, and wherein the 
observation of this emission spectrum is employed 
to determine the presence of polymerization 
product in step (b) . 

5 

51. The method of claim 45 or 50, wherein observing 
the predetermined emission spectrum is performed 
using radiation having a wavelength of between 
200 and lOOOnm. 

10 

52. The method of claim 51, wherein the radiation has 
a wavelength of 488 nm. 



53. The method of claim 45 or 50 wherein observing 

15 the predetermined emission spectrum is performed 

using radiation having a bandwidth of between 1 
and 50nm. 



54. The method of claim 53, wherein the radiation 
20 bandwidth is lnm. 

55. The method of claim 44 or 46, wherein the 
isolation-permitting moiety comprises biotin, 
streptavidin, phenylboronic acid, 

25 salicylhydroxamic acid, an antibody or an 

antigen . 



56. 



The method of claim 55, wherein the isolation- 
permitting moiety is attached to the 
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oligonucleotide via a 

57. The method of claim 
permitting moiety 
5 dideoxynucleotide via 



linker molecule. 

4 6, wherein the isolation- 
is attached to the 
a linker molecule. 



58. The method of claim 56 or 57, wherein the linker 
molecule is chemically cleavable. 



10 59. The method of claim 56 or 57, wherein the linker 

molecule is photocleavable . 



15 



60. The method of claim 59, wherein the linker 
molecule has the structure: 
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SEQUENCE LISTING 

<110> Ju, Jingyue 

Russo, James J 
Tong, Anthony 
Li, Zengmin 

<120> Combinatorial Fluoresence Energy Transfer Tags And 

Their Applications For Multiplex Biological Analyses 

<130> 0575/62238A/JPW/ADM 

<140> 
<141> 

<150> 09/658,077 
<151> 2000-09-11 

<160> 17 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: scaffold 
<400> 1 

tttttttttt tttttttttt tttttc 26 



<210> 2 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

ttaaaaagaa taagggtgtc 20 



<210> 3 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 3 

acatagccga tcggatagag 



<210> 4 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 4 

tcatagccga tcggatagag gc 



<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

acatagccga tcggatagag 



<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

gccgatttat gtgaaacact tgcg 



<210> 7 
<211> 26 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 7 

accgatttat gtgaaacact tgcgga 



<210> 8 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 8 

cggaagacag actcgtgggt 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

cttaatcttg tatagtagac ctgggaaa 



<210> 10 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

attaatcttg tatagtagac ctgggaaaag 



<210> 11 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

atagtagacc tgggaaaagg 



<210> 12 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 12 

tcgtgtggga cgtcttactc atacttgagt 30 



<210> 13 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gcgtgtggga cgtcttactc atacttgagt ac 32 



<210> 14 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 
<400> 14 

gtaaaaatga ctaatttttc ttattcccac agtgtatcgg ctagcctatc tccggctaaa 60 
tacactttgt gaacgccttc tgtctgagca cccagaatta 100 



<210> 15 
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<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 
<400> 15 

gtaaaaatga ctaatttttc ttattcccac agagtatcgg ctagcctatc tctggctaaa 60 
tacactttgt gaacgccttc tgtctgagca cccagaatta 100 

<210> 16 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 
<400> 16 

tacactttgt gaacgccttc tgtctgagca cccagaatta gaacatatca tctggaccct 60 
tttccagcac accctgcaga atgagtatga actcatgaga 100 



<210> 17 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 
<400> 17 

tacactttgt gaacgccttc tgtctgagca cccataatta gaacatatca tctggaccct 60 
tttcccgcac accctgcaga atgagtatga actcatgaga 10 
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